FIG. 1 

BLASTP - query = 147JTR1; Hit = swiss 1 0153 93 1 TMS2_HUMAN 

This hit is scoring at : 3e-66 (expectation value) 
Alignment length (overlap) : 370 
Identities : 38 \ 

Scoring matrix : BLOSUM62 (used to infer consensus pattern) 
Database searched : nrdb 

Q : 36 CDGWDCKLKSDEL GCVRFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCOOLGFES 

W T. M CVR. .. :L: :YS W P:C. . :WN: :Y. . . .C: . :G: : : 

H: 133 CDGVSHCPGGEDENRCVRLYGPNFILQMYSSQRKSWHPVCQDDWNENYGRAACRDMGYKN 

AHRTTEVAHRDFANSFS ILRYNST IQESLHRSE CPSQRYISLQCSHCGLRA 

. ::: . D :.S S.::.N:: I.:.L:.S: C.S:..:SL:C CG: .. : 
NFYSSQGIVDD SGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAWSLRCLACGVNLNSS 

MTGRIVGGALASDSKWPWQVSLHFGTTHEGGTLIDAQmTAAHCFFVTREKVLEG- - - 
...RIVGG. A ...WPWQVSLK. . .H:CGG: T. . i¥i :TAAHl\ EK L. . 
RQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWVTAAHCV EKPLNNPWH' 

TRYPSIN HIS 

WKVYA CTSNLHQLPEAAS- - IAE 1 1 INSNYTDEEDDYDIALMRLSKH 7TLSGSG I CT? 
W..:AG. . . : X :.::i :.NL,:..: UI ALM : L . KPLT . : . : :C.P 
WTAFAGILRQSFMFYGAGYQVQKVISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLP 

RSPAPQPQHPLQPSHLSASVNSYPGPKASIDKTSPFLREVQVNLIDFKKCNDYLVYDSYL" 

" "V " P LQP . .L . : : . : G. . . . ■ .KTS. .L. . . : V LI: ::CN. .VXD:. ; 
N PGMMLQPEQIrCWISGWGATEEKGXTSEVLNMKVLLIETQRCNSRYVYDNLI 

TPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRW YLAGVTSWGTGCGQRNKPGVYTKVTEV 
TP.M:(JAG L:G. WCQGUSGGPlV . . M W;L.G T^G:GC. : . :PGV¥ .V. . 
TPAMICAGFLQGNVDSCQGDSGGPLVTSNNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVF 

TRYPSIN SER 
LPWIYSKMEA 389 " 
..W1X.:M:A 
TDWIYRQMKA 490 



FIG. 2 



Prosite search results 

PS00134 187->193 TRYPSIN JKIS PDOC00124 
PS00135 334->346 TRYPSIN SER PDOC00124 



FIG. 3 

BLOCKS search results 

AC# Description Strength Score 

BL00495N Apple domain proteins. 1945 1582 

AAi 325 AGdlrGGrDsCqGDSGGPLVCeqNnRWyLaGvTSW (SEQ ID NO: 15) 

BL01253G Type I fibronectin domain proteins. 1641 1548 

AA# 332 rDsCQGDSGGPLVC (SEQ ID NO: 16) 

BL00134A Serine proteases, trypsin family, histidine p 1500 1524 

AAi 175 CGGTLIDAQWVLTAAHC (SEQ ID NO: 17) 

BL00021D Kringle domain proteins. 1556 1510 

AAi 341 GPLVCEQNNRWYLaGVTSWGtGCGQRNKPGVYTKVTevLPWI (SEQ ID N0:18) 

BL01253H Type I fibronectin domain proteins. 1765 1508 

AAi 351 wYLaGvtSWGtGCGQRNKPGVYTKVTevLpWIysk (SEQ ID NO:19) 

BL00021B Kringle domain proteins. 1547 1507 

AAi 175 CGGTLIDaQWVLTAAHCF (SEQ ID NO: 20) 

BL00495O Apple domain oroteins. 1756 1383 

AAi 360 GtGCGQRnkPGVYTKVtEvlpWIysKmeA (SEQ ID NO:2I) 

BL00134B Serine proteases, trypsin family, histidine p 1289 1299 

AAi 333 DSCQGDSGGPLVCEqNNRWYLAGV (SEQ ID NO:22) 

BL01209 LDL-receptor class A (LDLRA) domain proteins. 1413 1274 

AAi 35 CDGWDCKIKSDE (SEQ. ID NO:23). . - 

BL01253F Type I fibronectin domain proteins. 1693 1270 

AAi 288 AdktSpFLREvQVnLidfkKCndylVYdSylTPrMmCAG (SEQ ID NO:24) 

BL00495L Apple domain proteins. 1947 1263 

AAi 209 tSnlhqlpeaaSIaEIIInsNYtdeEddYDIALmrLskP (SEQ ID NO:25) 

BL00134C Serine proteases, trypsin family, histidine p 1245 1254 

AAi 369 PGVYTKVTEVLPWI (SEQ ID NO:26) 

BL01253D Type I fibronectin domain proteins. 1398 1217 

AAi 175 CGGtLIdaqWVLTA (SEQ ID NO:27) 
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